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Objective: The objective of this study was to explore the association of longitudinal CA-1 25 measurements with overall survival (OS) 
time by developing a flexible model for patient-specific CA-1 25 profiles, and to provide a simple and reliable prediction of OS. 
Methods: A retrospective study was performed on 275 patients with ovarian cancer who underwent at least one cycle of primary 
chemotherapy in our institute. Serial measurements of patients' CA-1 25 levels were performed at different frequencies according 
to their clinical plans. A statistical model coupling the Cox proportional hazards and the mixed-effects models was applied to 
determine the association of OS with patient-specific longitudinal CA-1 25 values. Stage and residual tumor size were additional 
variables included in the analysis. 

Results: A total of 1,601 values of CA-1 25 were included. Longitudinal CA-1 25 levels, stage, and the residual tumor size were all 
significantly associated with OS. A patient-specific survival probability could be calculated. Validation showed that, in average, 
85.4% patients were correctly predicted to have a high or low risk of death at a given time point. Comparison with a traditional 
model using CA-1 25 half-life and time to reach CA-1 25 nadir showed that the longitudinal CA-1 25 model had an improved 
predicative value. 

Conclusion: Longitudinal CA-1 25 values, measured from the diagnosis of ovarian cancer to the completion of primary 
chemotherapy, could be used to reliably predict OS after adjusting for the stage and residual tumor disease. This model could be 
potentially useful in clinical counseling of patients with ovarian cancer. 
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INTRODUCTION 

Among gynecologic cancers, ovarian cancer is the second 
leading cause of death worldwide and the leading cause in 
developed countries [1]. In patients diagnosed with advanced 
stage IIHV ovarian cancer (International Federation of Gyne- 
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cology and Obstetrics [FIGO] classification), the 5-year survival 
rate is -30%, whereas in those diagnosed at an early stage 
the 5-year survival rate is -90% [2]. The overall survival (OS) of 
patients with ovarian cancer has been extensively researched 
and many risk factors have been explored, including the 
serum cancer antigen 1 25 (CA-1 25) levels. 

CA-1 25 is the most commonly used biomarker for diagnosis 
and follow-up of ovarian cancer. It is a high molecular weight 
glycoprotein with an elevated serum level (>35 U/mL) in 50- 
90% of patients with ovarian cancer, depending on the cancer 
stage [3]. Many features of CA-1 25 have been explored for an 
association with OS, such as the absolute value of CA-1 25 at 
different time points during treatment, and pharmacokinetic 
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parameters, including CA-125 half-life [4]. However, the major- 
ity of survival studies have only considered a single CA-125 
measurement or a limited number at well-defined time points. 
However, patients usually have CA-125 measurements taken 
repeatedly (longitudinally) over time, and the number of mea- 
surements, as well as the intervals between measurements, 
vary between patients for different reasons such as different 
responses to chemotherapy or unexpected loss to follow-up 
[5]. Therefore, longitudinal and often misaligned measure- 
ments contain information that cannot be represented by 
single values or characteristics, and it would be beneficial 
to develop a model that takes into account all longitudinal 
measurements to give a clear indication of patient-specific 
CA-125 profiles for the analyses of an association with OS. 

Here, a recently published joint model, which couples the 
Cox proportional hazards and the mixed-effects models, was 
applied to assess the relationship between longitudinal CA- 
1 25 and OS [6]. The aim was to flexibly analyze repeated CA- 
125 measurements over time, including misaligned observa- 
tions of CA-1 25, to provide a simple and reliable OS prediction 
model in a patient-specific manner. FIGO stage and residual 
tumor size were included as variables in the model because 
they are important prognostic factors [2,7]. 

MATERIALS AND METHODS 
1. Data collection 

A retrospective chart review was performed in the Kaohsiung 
Veterans' General Hospital (KSVGH), a public teaching medical 
center in south Taiwan. Data were collected from 275 patients 
who were admitted to KSVGH with ovarian cancer between 
June 1 995 and January 201 2. A total of 6 cycles of primary 
chemotherapy with an interval of 3-4 weeks was planned for 
the patients, with all completing at least one cycle. At each 
cycle of chemotherapy, as well as before and after surgery, at 
least one measurement of the CA-125 serum level was taken. 
A complete list of CA-125 has at least 8 values, spanning 4-6 
months. However, patients may often suffer from adverse 
events after chemotherapy, resulting in treatment interruptions 
and postponement of the next cycle of chemotherapy. Some 
patients completed >6 cycles, but their CA-1 25 values beyond 
the primary chemotherapy were not included. In addition, 
some patients may have fewer cycles of chemotherapy due 
to various reasons such as disease progression or informed 
personal decision. FIGO stage and residual tumor size were 
included as covariates. The disease stage was obtained from 
the pathologic report of each patient, and the residual tumor 
size was obtained from their surgical notes. 



The acquisition of patient data was approved of by the Insti- 
tutional Review Board (IRB) of KSVGH. All patient information 
was retrieved from the hospital records, including electronic 
data and chart data. The extracted data included demographic 
and prognostic information such as age, blood tests, surgical 
notes, and pathologic reports. 

2. Definitions 

The end point of interest was OS, defined as the time from 
surgery to death or last follow-up. Those whose end point was 
the date of the last follow-up were defined as censored. Predic- 
tion of OS was achieved by calculating the survival probability 
of a patient at any given time point. A patient was defined as 
being at a high risk of death at a specific time point if the cal- 
culated survival probability was lower than the threshold, and 
at low risk if the probability was higher than the threshold. The 
threshold was dynamically estimated with each model fitting. 

3. Statistical methods 

All statistical analyses were conducted with the statistical 
computing and graphic drawing language, R [8]. A joint 
model, that coupled the Cox proportional hazards and the 
mixed-effects models, was constructed based on a freely 
available package, JM [9], to assess longitudinal measure- 
ments of CA-125 and OS data. The Cox proportional hazards 
model explores the association between each patient's OS 
and longitudinal CA-125 profile [10], while the mixed-effects 
model, often used in longitudinal data analyses, models the 
time trend of CA-1 25 profiles [1 1 ]. The advantage of using the 
mixed-effects model is that it can handle patients' misaligned 
observations of CA-1 25 across time. The CA-125 profiles of the 
patients were smoothed using lowess (locally weighted scat- 
terplot smoothing) regression [12], which is a non-parametric 
regression method that uses locally-weighted polynomials to 
smooth the data. 

To test the accuracy of the prediction, we performed valida- 
tion based on 3-year survival probabilities. Two-hundred and 
eighteen patients were used in this part of the study after 
excluding 57 patients who were censored within 3 years. 
In order to have a well-fitted model, we unevenly divided 
these 218 patients by randomly selecting 50 patients as the 
validation cohort so that a majority (168 patients, or roughly 
3/4) were left to train the model and for an estimation of the 
optimal threshold of high or low risk stratification. Once the 
model was fitted and an optimal threshold was estimated, we 
applied it to the 50 patients in the validation cohort to calcu- 
late their 3-year survival probabilities. We determined that the 
prediction was accurate if a patient died within 3 years and 
was grouped as high risk, and vice versa. The validation course 
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was repeated 10 times, with 50 randomly selected patients 
each time. 



RESULTS 
1. CA-1 25 profiles 

A total of 1,601 CA-1 25 values were collected, representing 
all the available CA-1 25 measurements of the 275 patients 
from the time of diagnosis of ovarian cancer to the comple- 
tion of the primary chemotherapy. The cut-off date of follow- 
up was July 31 , 201 2. At the time, 81 patients had expired and 
194 had been censored. Patient and tumor characteristics are 
summarized in Table 1. CA-1 25 profiles of the patients were 
extremely heterogeneous, ranging from 2-14 measurements, 
spanning over 2-1 0 months. Some typical examples of the 
CA-1 25 profiles are shown in Fig. 1. 



Table 1 . Patient and tumor characteristics (n=275) 



Characteristic 


Value 


Age (yr) 


51 (13-81) 


Overall survival (mo) 


56.5 (0.8-214) 


FIGO stage 




IA 


41 (14.9) 


IB 


5(1.8) 


IC 


42(15.3) 


IIA 


2 (0.7) 


MB 


6 (2.2) 


IIC 


12(4.4) 


MIA 


1 1 (4.0) 


NIB 


7 (2.5) 


NIC 


135 (49.1) 


IV 


14(5.1) 


Histologic type 




Serous 


97 (35.3) 


Mucinous 


30(10.9) 


Endometrioid 


53(19.3) 


Clear cell 


39(14.2) 


Teratoma* 


8 (2.9) 


Mixed 


19(6.9) 


Other + 


29(10.5) 



Values are presented as median (range) or number (%). 

FIGO, International Federation of Gynecology and Obstetrics. 

*Six cases of malignant immature teratoma, 1 squamous cell tumor 

associated with teratoma, and 1 malignant melanoma arising from 

teratoma. + Mullerian tumor, granulosa cell tumor, sex cord tumor, 

transitional cell tumor, dysgerminoma, undifferentiated, and unknown 

types. 



2. The relationship between longitudinal CA-1 25 and OS 
according to FIGO stage and residual tumor size 

The joint model was adapted to evaluate the relationship 
between longitudinal CA-1 25 and OS according to FIGO stage 
and residual tumor size as covariates. Residual plots showed 
that the model fit the data very well (Fig. 2). It could flexibly 
fit all the various CA-1 25 profiles, regardless of the number of 
CA-1 25 measurements including early-censored patients who 
only had 2-3 measurements. All the collected CA-1 25 mea- 
surements (n=1 ,601 ) were fitted to the model, and the results 
showed that the longitudinal CA-1 25 values were significantly 
associated with OS (p<0.001). 

To further address the relationship between longitudinal 
CA-1 25 and OS, we plotted lowess curves (Fig. 3) for those 
patients who died within 3 years (the high-risk curve) and 
those who survived for at least 3 years (the low-risk curve). It 
is obvious that those patients at high-risk had a higher CA- 
125 curve. It is also interesting to note that the two curves 
are farthest apart not at the initial phase, but in the later time 
period post one month. This is in line with our suggestion that 
CA-1 25 changes over time have a significant predictive power. 

Stage and residual disease are both important prognostic 
factors of OS, and we included them as variables in the 
model to adjust for the possible confounding effect they may 
have on estimating OS with longitudinal CA-1 25. Stage was 
treated as a binomial variable labeled as early (FIGO I and II) 
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Fig. 1. Typical examples of patient-specific CA-1 25 profiles. Different 
patients had different numbers of CA-1 25 measurements taken at 
different time points spanning different time frames. The profile of 
patient A contained only 3 measurements in the first two months 
following admission. Patient B had 8 CA-1 25 values taken throughout 
the course from surgery to completion of the primary chemotherapy 
(approximately 6 months). Patient C had four CA-1 25 measurements 
in the first three months and her preoperative CA-1 25 value was 
missing. Patient D had three CA-1 25 observations throughout the 
course from surgery to completion of the primary chemotherapy 
(around 6 months). 
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Fitted values Cox-Snell residuals 

Fig. 2. Residual plots of the joint model. (A) The subject specific residual plot for the validity of the mixed-effects model. The red line is the 
lowess curve of the fitted values. It is very close to the horizontal line from point 0 (the dotted line), suggesting a good fit. (B) The Cox-Snell 
residual plot for the accuracy of the second part of the joint model. They axis is the fitted survival time (Kaplan Meier estimates). The solid black 
line is the Kaplan-Meier curve of the residuals, which corresponds to the ideal curve (the red line) quite well, suggesting a good fit of the model. 
The two dotted lines give the 95% confidence interval. 
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Fig. 3. The lowess curves of CA-1 25 profiles according to high and 
low risk patients at 3-year survival. Those who died within 3 years 
after surgery are considered high risk (n=41 ), and those who survived 
for at least 3 years are considered low risk (n=1 77). Patients who were 
censored within 3 years were excluded from the curves. All the CA- 
125 values of each entire group were locally smoothened to generate 
a curve that showed the overall distribution of CA-1 25 profiles. 



or advanced (FIGO III and IV) stage. Residual tumor size was 
also treated as a binomial variable with a cut-off value of 1 cm 
[7], Our model showed that they were both significantly as- 
sociated with OS (p=0.004 for stage, and p<0.001 for residual 
tumor size; data not shown). 

3. Prediction of OS and validation 

Based on the joint model described above, survival prob- 
ability could be predicted at any given point of time for a new 



patient, provided that the CA-1 25 profile, FIGO stage (early 
or late), and residual tumor size was available. The patient- 
specific prediction is flexibly available for new patients even 
those with only 2-3 measurements of CA-1 25 over any time 
frame during the course from diagnosis to the completion 
of primary chemotherapy, and the survival probability could 
be calculated with the model at any time point, i.e., 1, 2, 3, 4, 
5 years, and so on. The higher the probability, the lower the 
risk of death is at that time point. (For a program to calculate 
subject-specific survival probabilities, please go to http:// 
www.math.nsysu.edu.tw/~cchang/predict_survival/.) 
The average accuracy of the prediction was 85.4%, ranging 
from 80.0% to 88.0% (Table 2). These stable results showed 
that the model could give reliable prediction according to the 
high and low risk groups. 

4. Comparison with CA-1 25 pharmacokinetic factors 

The predicative value of the longitudinal CA-1 25 was com- 
pared with the CA-1 25 half-life and time-to-nadir predictive 
values as described previously [13]. Using the Cox propor- 
tional hazards model, the survival probability for each patient 
can also be calculated using CA-1 25 half-life or time-to-nadir 
with FIGO stage and residual tumor size as covariates. We 
ran the same validation course 10 times as described in the 
above section. The average accuracy of prediction with CA- 
125 half-life and time-to-nadir were both 82.2%. Compared 
to the average accuracy of 85.4% of longitudinal CA-1 25, the 
longitudinal CA-1 25 showed a better predictive value than a 
single kinetic factor (Table 3). 
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Table 2. Validation of the accuracy of predicting overall survival 
(n=50)* 





No. of validation 


Accuracy (%) 


1 84.0 


II 88.0 




III 


88.0 




IV 


84.0 




V 


84.0 




VI 


86.0 




VII 


86.0 


VIII 88.0 




IX 


80.0 




X 


86.0 




Average 


85.4 



Validation was done for 10 times. At each time, 50 patients were 
randomly selected as the validation cohort and the rest of the 
patients were used to fit the model and for estimation of the 
optimum threshold of high/low risk. Once the model was fitted, it 
was applied to the 50 patients to classify them into high or low risk of 
death at the 3-year time point. From 40 to 44 patients were correctly 
predicted to be at high or low risk in all the 10 rounds, which gave an 
average accuracy of prediction of 85.4%. 

DISCUSSION 

CA-1 25 is widely used as a tumor marker for the diagnosis 
and prognosis of ovarian cancer. However, the exact relation- 
ship between CA-1 25 and OS is still unclear [14,15]. Many of 
the related studies focus on specific values of CA-1 25. For 
example, studies on absolute CA-1 25 values have shown 
that the CA-1 25 levels after surgery, after 2, 3, or 6 cycles of 
chemotherapy are important prediction factors for OS [16- 
20], but the preoperative CA-1 25 may not correlate with OS 
[21]. The results presented here also suggested that those 
patients survived <3 years had an overall higher CA-1 25 curve 
compared to those who survived 3 years or more, even in the 
first month of treatment, which typically represents the CA- 
1 25 changes after the first couple of chemotherapy cycles. 

There have also been studies on changes of CA-1 25 levels as 
kinetic parameters such as half-life and doubling time of CA- 
1 25, time to reach CA-1 25 nadir, and CA-1 25 area under the 
curve (AUC) [1 3,22-24]. However, focus on single CA-1 25 values 
at specific time points may only reveal a limited relationship to 
OS. A notable feature of CA-1 25 is that it is measured repeat- 
edly over time. Here, a comparison of the predicative values 
of using CA-1 25 half-life and time-to-nadir with longitudinal 
CA-1 25 revealed that longitudinal CA-1 25 performed better in 
predicting OS. This suggests that longitudinal measurements 
contain valuable information that cannot be fully represented 
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Table 3. Comparison of the accuracy of predicting overall survival 
with longitudinal CA-1 25, CA-1 25 half-life, and CA-1 25 nadir (n=10)* 



CA-1 25 measurement 


Average accuracy (%) 


Longitudinal CA-1 25 


85.4 


CA-1 25 half-life 


82.2 


Time to reach CA-1 25 nadir 


82.2 



•Validation was performed as described previously and in Table 2. 
The accuracy presented in this table was the average result of 10 
times. Prediction with CA-1 25 half-life and time-to-nadir was per- 
formed with Cox proportional hazards model, while prediction with 
longitudinal CA-1 25 was with the joint model. International Federa- 
tion of Gynecology and Obstetrics (FIGO) stage and residual tumor 
size were included in each model as covariates. 



by examining a single value or a kinetic factor. In addition, 
studies that considered CA-1 25 over a period of time treated 
multiple CA-1 25 levels individually [20,25,26], and there is so 
far no report that has taken all the measurements and fitted 
them concurrently to a survival model. 

In contrast, here all available CA-1 25 measurements of pa- 
tients regardless of whether a patient has missing data or not 
were used. Furthermore, the model was constructed based 
on an assembly of patients regardless of factors such as tumor 
type, stage, and chemotherapeutic agent, which enabled the 
model to be applicable to a wide population without recruit- 
ment restrictions. This approach would be potentially helpful 
to gynecologic oncologists to make better informed decisions 
in patient counseling with all the available information. 

No other effort has achieved OS prediction with longitudinal 
CA-125, which has obvious medical applications. In the 
management of many diseases, there are important markers 
that are taken repeatedly over time to monitor prognosis, 
such as CA 1 9-9 in pancreatic cancer [27] and CA1 5-3 in breast 
cancer [28]. Our model provides a simple alternative to inves- 
tigating how OS is related to those longitudinally measured 
markers and is accessible online (http://www.math.nsysu.edu. 
tw/~cchang/predict_survival/). 

We included FIGO stage and residual tumor size as covariates 
in our model. Patients at an early stage (FIGO I and II) usually 
have a much better prognosis than those at an advanced 
stage (FIGO III and IV) [2], and their CA-125 profiles may also 
be quite different. Similarly, patients who had a small residual 
tumor size (<1 cm) after surgery had a better prognosis than 
those with larger residual tumor (>1 cm) [7]. Inclusion of these 
covariates could adjust for potential confounding effects, and 
the validation result confirmed that prediction with CA-125, 
stage and residual tumor size could reach a high accuracy. 

Another prominent feature of our model is that it could 
give prediction of a future patient's survival probability at 
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any time point with patient-specific CA-1 25 observations. 
To test the performance of our prediction, we performed 
validation based on 3-year survival probability. The accuracy 
in the multiple validation efforts was very stable, with an 
average accuracy of 85.4%, ranging from 80.0% to 88.0%. 
This result confirmed that longitudinal CA-1 25 could be used 
as a reliable prediction factor for OS. The fact that multiple 
validation based on random selections gave stable results 
suggests that this model could be applied to various patient 
populations. However, it is necessary to validate the model 
in a large, heterogeneous external population in the future. 
It should be noted that since we based the OS prediction 
on longitudinal CA-1 25, the survival probability of a patient, 
who does not show elevated levels of CA-1 25 throughout 
the course, might be overestimated in our model. About 20% 
ovarian cancer patients have low CA-1 25 concentration [29], 
Fitting the model with a low CA-1 25 patient pool could solve 
the potential problem, and this relies on acguisition of more 
patient information. 

In conclusion, longitudinal CA-1 25 measurements from the 
time of diagnosis to the completion of primary chemotherapy 
are significantly associated with and can reliably predict OS 
in patients with ovarian cancer. Future studies should be 
directed to explore other prognostic factors in their relation to 
OS, and the model should be tested in a large, external, and 
heterogeneous population. 
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